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Abstract 

This paper focuses on a class of linear Hawkes processes with general immigrants. 

These are counting processes with shot noise intensity, including self-excited and 
externally excited patterns. For such processes, we introduce the concept of age 
pyramid which evolves according to immigration and births. The virtue if this ap¬ 
proach that combines an intensity process definition and a branching representation 
is that the population age pyramid keeps track of all past events. This is used 
to compute new distribution properties for a class of linear Hawkes processes with 
general immigrants which generalize the popular exponential fertility function. The 
pathwise construction of the Hawkes process and its underlying population is also 
given. 

Keywords: Hawkes processes, branching, immigration, age pyramid, non-stationarity, 
laplace transform, thinning, Poisson point measure. 


1 Introduction 


This paper investigates the link between some population dynamics models and a 
class of Hawkes processes. We are interested in processes whose behavior is modified 
by past events, which are self-excited and externally excited. The introduction of a 
self-excited process with shot noise intensity is due to Hawkes (1971) and the famous 
Hawkes process has been used until now for a variety of applications, including 
seismology, neuroscience, epidemiology, insurance and finance, to name but a few. 
The shot noise intensity of the Hawkes process (W) is expressed as 


~ Tn), 

Tn<t 
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where the T„ are the times of jump of the Hawkes process N itself, fi > 0 and 0 is 
a non-negative function. In the Hawkes model, when an event occurs at time T„, 
the intensity grows by an amount (j){t — Tn): this models the self-exciting property. 
Also, for many modeling purposes, (j) returns to zero as t increases, so that the self¬ 
excitation vanishes after a long time. On the whole, each event excites the system as 
it increases its intensity, but this increase vanishes with time as it is natural to model 
the fact that very old events have a negligible impact on the current behavior of the 
process. In the literature, more recent contributions focused on processes with self¬ 
exciting behavior and also some externally exciting component. To our knowledge. 


the Hawkes process with general immigrants has been introduced in Bremaud and 


Massoulie (2002), and specific forms can also be found in recent studies motivated 


by financial applications, such as Dassios and Zhao (2011), Wheatley et ah (2014) 
and Rambaldi et al. (2014), where external shocks, news arrivals and contagion are 
crucial to model. In this paper, we are interested in a class of Hawkes processes 
with general immigrants (see Bremaud and Massoulie (2002)), whose intensity is of 
the form 

At = Kt) + Mt - Tn, Xn) + - Sk, W). 

T„<t Sk<t 

In this model, the Tn are the times of jump of A^: if an event occurs for the system 
at time T„, the intensity grows by an amount — Tn,Xn), where Xn is some 
mark. This part models the self-exciting property. In parallel, external events occur 
at times Sk and excites the system of interest with some amount — Sk, W): this 
is the externally excited component. 

Among the appealing properties of such models, one of them comes from the shot- 
noise form of the intensity. This is called the cluster (or branching) representation 
of the Hawkes process, and it is based on the following remark; if an event occured 
at time Tn, then t — Tn is nothing but the "age" of this event at time t. Few 
years later after the seminal work of Hawkes ( 1971| , Hawkes and Oakes (1974) 
proposed the cluster representation of the self-exciting process. They interpreted it 
as an immigration-birth process with age: they proved that under some stationarity 
conditions, it can be described as a branching Poisson process (also called Poisson 
cluster). Also, in Dassios and Zhao (2011), a definition of a dynamic contagion 
process is given through its cluster representation. Until now, most studies on the 
Hawkes process recalled the immigration-birth representation as follows: immigrants 
arrive at times given by a Poisson process with intensity /i. Then each immigrant 
starts a new generation; it gives birth to new individuals with fertility function 
0, each one giving birth with same fertility function 0. This is often used as a 
definition for the Hawkes process, providing a good intuition on its behavior. The 
cluster representation of Hawkes and Oakes (1974) requires that the mean number 
of children per individual which is nothing but ||0|| = </)(a)da satisfies ||(;/)|| < 1. 
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In our paper, we exhibit the underlying immigration-birth dynamics which does not 
require the stationary assumption. Each individual in the population has an age 
and a characteristic. The virtue if this approach that combines an intensity process 
definition and a branching representation is that the population age pyramid keeps 
track of all past events. This is used to compute new distribution properties for a 
class of linear Hawkes process with general immigrants. 

In the literature, the distribution properties of the Hawkes process have first 


been studied under stationary conditions. Hawkes (1971) addressed second order 


stationary properties, whereas Adamopoulos (1975) derived the probability gener¬ 


ating functional under stationarity, by using the cluster representation of Hawkes 


and Oakes (1974). In this work, Adamopoulos (1975) expressed the probability gen¬ 


erating function as a solution to some functional equation. Furthermore, [Bremaud 


and Massoulie (2002) introduced the framework for studying moments of the sta¬ 


tionary Hawkes process by means of the Bartlett spectrum. Let us also mention 
two recent studies of the distribution properties under stationarity. The moment 


generating function has been expressed in Saichev and Sornette (2011) as a solu¬ 


tion to some transcendental equation. In addition, Jovanovic et ah (2014) proposed 


a graphical way to derive closed form expressions for cumulant densities, leading 
to the moments of the stationary Hawkes process. It is interesting to note that 
such recent contributions rely the stationary branching representation of Hawkes 


and Oakes (1974). Recently, the computation of statistical properties has gained 


attention under non-stationarity, both for mathematical analysis and statistical es¬ 
timation techniques. However, the recent studies in this framework only focus on 
exponential fertility rates 0(f) = ae^^. The tool they rely on is the infinitesimal gen¬ 
erator of the intensity process (At) which is Markovian for such exponential fertility 



paper generalizes these studies in a natural direction for a wider class of Hawkes 
processes. 


Scope of this paper The aim of this paper is (i) to introduce the concept of 
age pyramid for general Hawkes processes and study its dynamics over time, (ii) 
to use this concept to compute new distribution properties for a class of fertility 
functions which generalize the popular exponential case, and (hi) to give a pathwise 
representation of the general Hawkes processes and its underlying immigration-birth 
dynamics. We represent the population as a multi-type dynamics with ages, includ¬ 
ing immigration and births with mutations. Our population point of view that 


introduces the concept of age pyramid is inspired by Bensusan et ah (2010-2015) 


see also Tran (2008)). As highlighted in Bensusan et ah (2010-2015), the key idea 
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is that the population structure in terms of ages and characteristics, which keeps 
track of past events, provides much more information than the intensity itself and 
allows to study the whole system. In this way, we address the computation of dis¬ 
tribution properties of the Hawkes process with general immigrants for a wide class 
of time-dependent fertility functions. We also give the pathwise construction of the 
age pyramid represented as a measure-valued process solution to a stochastic equa¬ 
tion driven by a Poisson point measure, which is the theoretical counterpart of the 
thinning numerical procedure. Our approach seems to reconcile the two definitions 
of Hawkes processes, through an intensity process or a branching dynamics. 

The paper is organized as follows. Section focuses on the standard Hawkes pro¬ 
cess with time-independent fertility function. On this particular case, we give the 
population point of view and study the dynamics of the age pyramid over time. In 
Section we use this concept to compute new distribution properties as moments 
and Laplace transform for a class of Hawkes processes which generalizes the popular 
exponential case. Section [^details the pathwise contraction of the standard Hawkes 
process and its underlying population. Our general population representation and 
results are given in Section where we focus on Hawkes processes with general 
immigrants. In particular, we derive its dynamics and Laplace transform for a wide 
class of time-dependent fertility functions. 


2 Population point of view 


The definition of the (standard linear) Hawkes process through its intensity is given 
below. Let (O, A, P) be a probability space satisfying the usual conditions. Recall 
that the intensity process (At) of a counting process (W) is the (J^/^)-predictable 
process such that W — A^ds is an (Jy^)- local martingale, where (Jy^) denotes 
the canonical filtration of {Nt). 

Definition 1. Let cj) he a continuous and non-negative map. A Hawkes process (W) 
with kernel (f) is a counting process with canonical filtration {Hfi) which admits an 
-predictable intensity 

Xt = p^+'^fiit-Tn) = j 0(f-s)dW, (1) 

T„<i “'(O’d 

where > 0, and the {Tn) are the times of jump of (W)- 


The previous definition provides the representation of the intensity process, 
which is interesting in order to study the behavior of the Hawkes process. But 
in fact, the whole information on the dynamics is lost. Indeed, it is interesting to 
go back to the branching representation of Hawkes and Oakes (1974) to have in 
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mind the underlying population dynamics. First, immigrants arrive according to 
some Poisson process with parameter /i. Then each immigrant generates a cluster 
of descendants with the following rule: if an individual arrived or was born at some 
time s, it gives birth to new individuals with rate (j){t — s) at time t. Note that in 
fact, t — s is nothing but the age at time t of the individual born at time s. The birth 
mechanism can thus be reformulated as: any individual with age a in the popula¬ 
tion gives birth with rate 0(a). The whole dynamics describes an immigration-birth 
process with age, in which the immigration rate is /i and the birth rate is 0(a). 

Since the immigration-birth mechanism is crucial to understand the Hawkes 
dynamics, the aim now is to keep track of all ages in the population. One way 
to address this issue is to count the number of individuals with age below a > 0 
at time t, denoted Zt{[0,a]). This can be computed as the number of individuals 
arrived until time t without those arrived before t — a, that is 

■^t([0, a]) = Nt — Nt-a = [ lf_s<adiVs. 

J{0,t] 

The previous equation shows that with hxed t, this dehnes a measure on the space 
M+ of ages that is an image of the jump measure dNt. It can be written as 

p Nt 

Zt{da)= 6t-s{da)dNs = '^6t-T^{da). (2) 

Note that Zt{da) charges only [0,0 since no individual born after time 0 can reach 
an age greater than t. Formally, the measure .^t(da) puts a weight on the age of each 
individual alive at time t, therefore we call it age pyramid in reference to demographic 
analysis. In general, demographic studies focus on the number of individuals per age 
class of e.g. one year, so the quantity of interest is e.g. ^t([a, a -|- 1)). The virtue of 
the measure representation is that one can compute a function / of the population 
age structure by integrating it with respect to the age pyramid. To do this, we use 
the notation 

{ZtJ)= [ f{a)Zt{da)= [ /(t-s)diV,. (3) 

Jm+ 

For example, the Hawkes process can be computed as = {Zt,l). Also, the 
intensity process dehned in Equation Q can be rewritten using ([^ as 

= h + 0)- 

The intensity is the sum of the migration intensity /i and the individual birth inten¬ 
sities: this is indeed the intensity of an immigration-birth process with migration 
rate /r and birth rate 0(a), in which all individuals behave independently. Viewed 
as a stochastic process, (Z0da))t>o is a measure-valued process. In fact, this age 
pyramid process, that is the measure-valued process {Zt{da))t>o, is a Markov process 
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(see Tran (2006)). Note however that its differentiation in time is not straightfor¬ 
ward (see Bensusan et ah (2010-2015) and Lemmabelow). The Markov property 
of the age pyramid process shows that all the information needed is contained in the 


population age structure. Let us mention the seminal point of view of Harris (1963), 


for who "it does seem intuitively plausible that we obtain a Markov process, in an 
extended sense, if we describe the state of the population at time t not simply by 
the number of objects present but by a list of the ages of all objects." However, in 
practice this information is "too large" to perform tractable computations. In the 
next Section, we illustrate how to identify some minimal components to add to the 
Hawkes process in order to make the dynamics Markovian. To do this, we first need 
to address the time evolution of the age pyramid. The following Lemma details the 
dynamics of {Zt, f) in the case where / is differentiable. This is the key tool for our 
results in Section |3l 

Lemma 1. For each differentiable f : 


-)■ 


(Z„/)=/(0)(Z„l)+ / {Z,J')ds. 

Jo 


(4) 


u 


Proof of Lemmaj^ Let us write between s and t, f{t — s) = /(O) -1- J* f{u — s)d 
and use it into Equation (3) to get {Zt, f) = f{0){Zt, 1) -|- /q ^ f* f'{u — s)dnj diV^. 
By Fubini’s theorem, the last term of the sum is equal to f* f'{u — s)dNs) du, 
and by Equation (3), this is equal to f* (Zu,f'}du. This concludes the proof, o 


The decomposition (|^ is classical in the field of measure-valued population dy¬ 


namics see 


Tran (2008) and Bensusan et ah (2010-2015)). The first term refers to 


the pure jump part of arrivals of individuals with age 0, whereas the second term of 
transport type illustrates the aging phenomenon; all ages are translated along the 
time axis. In particular, this shows why the intensity process At = /u -h {Zt-,<p) is 


Markovian in the case where the fertility function is exponential (see Oakes (1975)), 


that is 0(a) = Q;e^“. In this case, 0' = 00, and Equation (|^ with / = 0 leads to 
the differential form 

d{Zt, 0) = adNt + (3{Zt, 0)dL 

Note that dNt only depends on the past of (A0 by means of the current value At, 
which proves the Markov property. This remark is the starting point of our study, 
which extends the exponential case in a natural setting. 


3 The exponential case generalized 

In this section, the aim is to use the concept of age pyramid process introduced in 
Section in order to compute several distribution properties for the non-st at ionary 
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3.1 Assumption on the birth rate 


Hawkes process. In particular, we provide ordinary differential equations for first and 
second order moments and the Laplace transform. All computations are performed 
under some assumption on the birth rate 0 which naturally extends the popular 
exponential case. 


3.1 Assumption on the birth rate 

Assumption 1. The map a G M+ e-)■ 0(a) is non-negative, of class and 

there exists c = (c_i, ...,c„_i) G such that 0 statisfies 

n—1 

0(")=c_i + 5^Cfc0(^), (5) 

k=0 

with initial conditions 0*^^^(O) = m^, for 0 < k < n — 1. 


The birth rates that satisfy Assumption include the exponential case but also 
some fertility functions that are interesting for a variety of applications. Let us 
introduce the vector 

m = (l,mo, (6) 


and the matrix C = (Ci,j)-i<i,j<n-i given by = lforO<i<n — 2 and 

Cn-ij = Cj for —1 < j < n — 1, all other components being zero. Since it is fully 
determined by the vector c, we denote 


/ 0 


C{c) = 


0 

0 


Vc -1 Co 




0 1 
Cn— 2 C ^_Ij 


(7) 


Equation ^ can be rewritten where $ = (1,0,..., 0*-”'“^^)^, whose 

solution is given by ^(a) = Then 0 can be recovered as the second component 

of the matrix ^(a). In particular, if the polynomial P{y) = — YltZo ^ky^ is split 

with distinct roots yi, ...,yp and corresponding multiplicity ni, ...,np, then 0 can be 
written up to some constant as where Pi is a polynomial with degree 

at most Ui — l. This is a sufficiently large set of functions to approximate any fertility 
function outside of the range of Assumption!^ As an example, the power law kernel 
is of importance for many applications. In the context of earthquakes, the Omori law 
describes the epidemic-type aftershock (ETAS) model; it corresponds to a specific 
form 0(a) ~ Also in the field of financial microstructure, recent studies (see 

e.g. Hardiman et al. ( 2013[ )) found that high-frequency financial activity is better 
described by a Hawkes process with power law kernel rather than exponential. The 
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3.2 Dynamics 


power law kernel with cut off can be approximated as in Hardiman et al. (2013) up 
to a constant by the smooth function 


g-a/(Tomp 






i=0 


where S is such that 0(0) = 0. In general, one can use approximation theory to 
construct a sequence of fertility functions which tends to the original one. As a 
result, this constructs a sequence of Hawkes processes that approximate the original 
Hawkes process. 


3.2 Dynamics 

Let us go back to the dynamics of the age pyramid over time. The key property 
that will allow us to compute distribution properties is that the population enables 
to identify the components to add to the Hawkes process and its intensity to make 
the dynamics Markovian. This is stated in the following proposition. 

Proposition 2. Under Assumption^ the process Xt = {{Zt, 1), •••, 

satisfies the dynamics 

Xt = Nm+ [ CXAs, ( 8 ) 

Jo 

where we the vector m and the matrix C are given in ^ and 0 respectively. In 
particular, X is a Markov process. 

Proof of Proposition Let us use Lemma to get for 0 < fc < n — 1, with 

/ = 

{Zt, = mM + [ {Zs, 0('=+'))ds. (9) 

Jo 

By Assumption we get in particular 

n-l 

{Zt, + V Cfe / {Z„ 0W)d5, (10) 

k=-i 

with convention 0^“^) = 1. This implies the dynamics ^ which also shows that X 
is a Markov process, o 

The dynamics ([^ for the (n + l)-dimensional vector Xt gives a set of n equations, 
the hrst component of X, which is the Hawkes process N, being free. In Section|^we 
will give an equation on the Hawkes process N by means of stochastic representation 
based on Poisson point measures. This will provide a full system of equations for the 
components of X as well as a pathwise representation. For now, we are interested 
into deriving several distribution properties of the Hawkes process and its additional 
components in X. 
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3.3 Moments 


3.3 Moments 

First order moments The differential system of Equation (|^ is linear and allows 
us to propose a straightforward differential equation for the first order moments. We 
also perform explicit computations for small dimensions n = 1 and n = 2. 

Proposition 3. Under Assumption^ the vector map u{t) := E [Xt] is solution to 

u\t) = p.m + Au(t), ( 11 ) 

where the (n + 1) x (n + 1) matrix A is given by 

A = C -h mJ, (12) 

where 

J=(0,1,0,...,0), (13) 

and the vector m and the matrix C are given in & and & respectively. 


Proof of Proposition Let us use the martingale property of the compensated 
counting process, then use Fubini’s theorem and the fact that Lebesgue measure 
charges no point to get E [Nt] = Jq* (/i + E[(Zs, 0)]) ds. Now, let us take expectation 
in ' ' 


and use the previous formula to get Equation (11). o 

The differential equation ( [IT| ) allows to get explicit formulas for the expected 
number of events. We recall the first order moment for the popular exponential case 
0(a) = (see e.g. Dassios and Zhao (2011)) and also give the explicit formulas 
for the birth rate 0(a) = a^ae~^°‘. Note that this case can be useful for a variety 
of applications to model a smooth delay at excitation. Remark also the different 
behavior of the first order moments, in particular in the critical case 0(a)da = 1, 
which corresponds to c = 1 and a = (3. For the two examples given below, the 
computations are left to the reader. 


Corollary 1. For the Hawkes process with (j){a) = e c > 0, (n = 1 m Assumption 

0. 

E|iV,] = (i fi + 0 s/c=l, 

Li f — 1 \ 

E|A',| = 00( -ctf 

Corollary 2. For the Hawkes process with 0(a) = a^ae~^°’, a,/3 > 0, (n = 2 in 
Assumption^, 


E[W] 

E[W] 


I, (1+ fit = A 

/i02 a/i - 1 

02 ^ T V (a - 0)2 (a + 0)2 ) 


if a ^ 13. 
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3.3 Moments 


Second order moments In this subsection, we derive the dynamics of the variance- 
covariance matrix of the process := (Nf, (Zt, (Zt, As a conse¬ 

quence, we represent second order moments as the solutions to a linear ordinary 
differential equation. Our method is based on differential calculus with the hnite 
variation process (Xt) with dynamics (|^ and could be extended to higher moments. 

Proposition 4. Let us introduce the variance-covariance matrix Vt = XtXt, where 
Xt denotes the transpose of Xt. Then the matrix Vt satisfies the dynamics 

dVt = dNt [Xt-fh mXt- -|- mm) dt (VtC CVt) . 

In particular, the matrix v{t) = E [Vt] satisfies the following ordinary differential 
equation 


v'{t) = v{f)A + Av{f) + p{mm + u{f)m + mu{f)) + Ju{f)mm. 
where u{t) is solution to 0 and the matrix A is defined in ([7^. 


(14) 


Proof of Proposition]^ Let us use the notation Xt = {x\ ^\xf \ ...^x'fi' 
Integration by parts leads to, for —1 <l,k<n — l, 

d (xf = Xfldxf + X]!dxf + mkmidNt. 

The previous equation shows that dVt = Xt_dXt -|- (dXt)Xt_ -|- dNt.mfh. By Propo¬ 
sition and since Lebesgue measure charges no point we get 

dVt = dXt (Xt_rh -|- mXt- + mm) dt (VtC -|- CVt) ■ 


Recall that u{f) = E [Xt]. Now, take expectation in the previous equation to get 


v'it)=E (/i + xf)Xt 


m-|-mE 


(/i + Xf')Xt + (/i + E[ Xf ]) mrh+v{t)C+Cv{t) 


Finally, note that xf'Xt = VtJ where we recall that J is dehned by J = (0,1, 0,..., 0), 
which makes the previous equation reduce to (14). o 


We give explicit formulas for the popular exponential fertility function ^(a) = 
g-ca ^ higher order for the case ^(a) = which corresponds to the 

critical case since the mean number of children per individual satisfies f>(a)da = 
1. Computations are based on the differential equation (14) and are left to the 
reader. 


Corollary 3. For the Hawkes process with 4>{a) = e ^ (n = 1 in Assumption^, 

Var(Xi) = pt(^l + M + if c = 1, 

AC - 1 


Var(W) = 


h 

(l-c)= 


1 c/2 ^2(l-c)t 

1 — c 


1 — c 


— 2ct 


M-c)t _ g 3 ^ c(l/2 3c) 

1 — c 


i/c ^ 1. 
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3.4 Laplace transform 


Corollary 4. For the Hawkes process with (j){a) = (3^ae (n = 2 in Assumption 
the variance of the intensity is given by 


Var (At) = /3p 


7 3 / 3 ^ ^ I 1 ^-2 yt _ _^g-4/3t 


+ —t + —r + 
128 32 16 


128 


3.4 Laplace transform 


The aim is to exhibit the exponential martingale associated with the process X 
which consequently expresses its Laplace transform in a semi-explicit form. This 
is given in the following proposition. It is interesting to note that point (i) refers 
to some forward martingale, whereas point (ii) focuses on backward martingality. 
Note that based on the Laplace transform, it is classical to recover moments of any 
order. 

Proposition 5. Let us denote {fF^) the canonical filtration of the process X and 
let us work under Assumption^ 

(i) For any deterministic and differentiable At, the following process is an (Jq 


■X'l 


(15) 


martingale: 

expl^.Xi- [ As.{CXs)ds- [ AfXAs- [ - l)A,ds 

I Jo Jo Jo 

(ii) For any {n + 1) real vector v, 

E [exp {v.Xt)] = exp ^ (1 “ e"^^-™)ds^ , 
where the vector map A satisfies the following non linear differential eguation 

C'4t + Al( + (e^*-”^-l)J = 0, (17) 

with terminal condition At = v. Here, v.Xt denotes the scalar product between v 


(16) 


and Xt, J is defined in (13), and C is the transpose of the matrix C. 
(hi) 


Moreover, there exists a unigue solution to Eguation 


Proof of Proposition!^ The exponential formula states that the following process 
is a martingale, for any deterministic ag, 


exp I j agdNs - J (e"" - l)A^ds 
Now, by integration by parts and the use of Equation 


At.Xt= [ 4,.dX,+ [ 4'.X,ds 
Jo Jo 

= f As.mdNg+ f 4,.(C'X,)ds+ [ 4',.X,ds. 
Jo Jo Jo 


(18) 
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Then by Equation (18) with = As.m, the process in (15) is a martingale. To prove 
the second point, the aim is to find a martingale of the form exp{At.Xi; +D{t)} with 
some deterministic D{t) and a terminal condition At = v. To do this, let us choose 
A such that the random part in the integrant in (15) vanish. Since = /i+ {Zs-., 4>), 
this amounts to get for each vector X = ...., 

CAfX + A[.X + - l)(/i + = 0. (19) 

Let us now identify the term in X, leading to the equation for A: CAt.X + A'^.X + 
_ x)x[°l = 0, that is CAt + — 1)J = 0, where J is defined in 

(13). If we set terminal condition At = v, we get Equation (20). Finally, existence 
and uniqueness for Equation 0 arises from Cauchy-Lipschitz theorem, since the 
map Y I—)■ CY + (e^'™ — 1) J is of class on and thus continuous and locally 

Lipschitz. o 


The previous result can be expressed in terms of a single function, and is derived 
below for the Hawkes process and its intensity. The proof is given in Appendix. 

Corollary 5. Under Assumption^ the joint Laplace transform of the Hawkes pro¬ 
cess and its intensity is given for each real 9i and by 


n—1 


E [exp {O^Nt + 02 At)] = exp -p 


n, 


k=0 




( 20 ) 


where the function G satisfies the non-linear ordinary differential equation: for each 
0<t<T, 


n—1 


n—1 


+ exp I 6i — c_iG(f) + kG^^+^\t) -1 = 0 , 


k=0 


k=0 


with terminal conditions = 0 for 0 < k < n — 1 and G^"'\T) = (— 1 )"' ^ 6 * 2 , 

( 21 ) 


and for 0 < k < n - 1, bk = (- 1 )*" {mn-i-k - 


k-\-l ^^n—l—lCn—l+k) • 


4 Pathwise representation of Hawkes population 

The aim of this Section is to detail the pathwise construction of the Hawkes process 
and its underlying population. This is done by means of stochastic differential 
equations driven by Poisson point measures. The virtue of this approach is that it 
seems to reconcile both the intensity process definition of the Hawkes process and its 
branching representation. We first describe the construction of the Hawkes process 
with reference to a Poisson measure, then exhibit the system of equations driving the 
generalized exponential case, and finally address the pathwise population dynamics 
for general birth rates. 
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Construction of the Hawkes process One question which arises with Dehnition 
[prefers to the construction of such process and the notion of pathwise uniqueness. 
An answer can be given by the thinning representation, 

which works as follows. Consider a Poisson point measure Q(ds, d6') with inten¬ 
sity measure ^(ds, d^) = dsd6' on M+ x M+ (see e.g. Qmlar (2011) for a definition), 


and denote the canonical filtration generated by Q. Note that the intensity 

measure q is not finite and only a-finite, which makes it impossible to order the points 
in time of the Poisson point measure Q. But this fiexible representation allows to rep¬ 
resent a wide class of counting processes. Let (At) be a (Jq^)-predictable process such 
that a.s. for each t > 0, f* A^ds < -)-oo. Then the following process (N^) is a count¬ 
ing process with (Jy^)-predictable intensity At: Nf = lfQ^)^^j(0)Q(ds,d0). 

Indeed, N is clearly a counting process because each atom of Q is weighted 1 or 0. 
Also, since a.s. f* A^ds < -|-cxd, the martingale property for Poisson point measures 
ensures that Nt — Jq l[o,As](^)d6'ds = Nt — A^ds is a (Jq^)- local martingale. 
Now, let us describe the construction of the Hawkes process. Since the intensity in 
([^ is given as a particular form of the process itself, the idea is to define the Hawkes 
process as the solution to the stochastic equation 


</>(s-u)dAf„] (^)Q(ds, d0). 


( 22 ) 


General results about existence and uniqueness for the Hawkes process (even non- 


linear) can be found in 

Bremaud and Massoulie 

(|1996 

) and 

Massoulie (1998|) (see 

also Delattre et ah 

(2014 

) and the books of Daley and Vere-Jones 

(2008 

) and Qmlar 


(2011)). The thinning method to represent a counting process as the solution of 
a stochastic equation is in fact classical. This general mathematical representation 


goes back to Kerstan (1964) and Grigelionis (1971). One often refers to the thinning 


algorithms that have been proposed by Lewis and Shedler (1978) and Ogata (1981), 


which are very useful to perform numerical simulations for quite complex intensity 
processes. A first advantage of the thinning formulation arises when one wants to 
show the existence of the Hawkes process. This is done by Picard iteration method 
(see Massoulie (1998)): one constructs a sequence {N’^)k>o of counting processes 


starting at = 0, and for A: > 0, 


= 


(o,t] 


l[0,/i+/ju n)dVfc] (^)Q(ds, d0). 


(23) 


One can show that the sequence (N^) is Gauchy and thus converges to the desired 
process. Moreover, another advantage is to give strong uniqueness. With this issue, 
it appears that the thinning representation has the virtue to use "one noise once for 
all" and thus give pathwise construction and results. This is interesting to note that 


this approach is used by Delattre et ah (2014) to show existence and uniqueness of an 
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infinite graph of interacting Hawkes processes. Due to the pathwise representation 
and the iterative construction, one can also identify each generation in the dynamics. 


Indeed, one sees in the construction of the Cauchy sequence in (23) that counts 


the number of immigrants, whereas counts the children of immigrants, 

the grandchildren of immigrants, and so on. Generally, is 

the number of individuals in generation k born before time t. This shows another 
advantage of the pathwise construction: what is called the "thinning parameter" 
9 gives additional information on the dynamics, making it possible in particular 
to study each generation separately. Before giving the representation of the age 
pyramid, we first go back to the extension of the exponential case. 

Exponential case generalized We first address the particular case where the 
birth rate 0 satisfies Assumption The dynamics of the {n + 1)—dimensional 
vector Xt := (W, (^t, </>), ■■•, (^t, is given in ^ by dX* = dNtm + CXtdt. 

This gives in fact n equations, the first coordinate Nt being free. The pathwise 


representation (22) allows to derive the full system of equations by 


dw= / mlr [on(0)g(df,d0) + GWdf, 

J R_l_ L J 

where we recall the notation X* = (x]~^',x]°^, ...,x]”~^'). 

Immigration-birth process with general fertility functions In the case where 
the birth rate is general, one has to represent the whole age pyramid, that is to give 
the thinning representation of the underlying immigration-birth process. In the field 
of population dynamics, this approach is used to construct extended birth-death pro¬ 


cesses with age in particular in Bensusan et ah (2010-2015) (see also Fournier and 


MG^rd (2004) and Tran (2008)). From Equations (|^ and (22), we get the pathwise 
representation 


Zt{da)= / lio,f,+{Zs-M{0)^{t-s){da)Q{ds,de). 

Jr+ 


(24) 


This illustrates the fact that the population at time t is nothing but all individuals 
that arrived before time t (immigration or birth); if an individual arrived at time 
s, its age at time t is t — s. Note that in this form, the differentiation is not 
straightforward (see Bensusan et al. (2010-2015)). But from Lemma[^ one can write 
the following (infinite) system of equations: for each differentiable / : M+ —)■ M, 


A{Zu f) = /(O) / l[„,;,+(z,_.<,),(9)Q(di, dff) + {Z„ f')dt. 


(25) 


This approach seems to reconcile the intensity process definition of the Hawkes 
process with its branching representation. Indeed, the population age pyramid is 
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given through Equation (25) as a stochastic measure-valued process with its own 
intensity. Before going through the last Section on the Hawkes process with general 
immigrants, we briefly discuss the existing cluster representation in the following 
remark. 


Remark 1. We recall the definition of the Hawkes process in terms of a Poisson 


cluster introduced in Hawkes and Oakes (1974) surveyed in the book of Daley 


and Vere-Jones { 200^ . Let Nc{ds) he a Poisson point measure on M_|_ with intensity 
measure fads: this defines the cluster centers, also called ancestors. Let us introduce 
a family of point processes {iV(df | s),s G M+j. For each s, N{dt \ s) defines 
the location of the offsprings within the cluster of an ancestor located at s. The 
cluster process N counts the number of all offsprings of all immigrants by N{dt) = 
Jg N{dt I s)Nc{ds). That is, the number of all offsprings up to time t is given by 


N{[0,t])= iV([0,f] I s)iV,(ds). 


Thus in the cluster representation, the Hawkes process can be written as the sum of 
the immigrants and their offsprings by 

N,{[0,t]) + N{[0,t]). 


Note that the cluster representation has shown to facilitate the study of the 
Hawkes process under stationarity by using results in the field of branching pro¬ 
cesses. Our population representation seems to be the non-stationary counterpart, 
as it allows us to derive new distribution properties in this framework. Our popula¬ 
tion representation provides not only the size of the total progeny up to time t, but 
also a variety of guantities of interest depending on the population age structure. 
This has been used in Section\^ in order to identify the components needed to make 
the dynamics Markovian. This will be also used in the following Section to study a 
class of Hawkes process with general immigrants. 


5 Towards more general Hawkes processes 


In this Section, we focus on a class of counting processes Nt named as Hawkes 
processes with general immigrants (see Bremaud and Massoulie (2002)), which is 
dehned below. 


Definition 2. A Hawkes process with general immigrants is a counting process Nt 
whose intensity is given by 

Xt = /i(f) + Mt - T„, X„) + dit{t - Sk, n), (26) 

T„<t Sk<t 
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5.1 Description of the two-population dynamics 


where the are the times of jump of N, the Sk are the jumps of a counting process 
with deterministic intensity p{t) and the (resp. Y^) are real positive iid with 
distribution G (resp. H). The {Sk), iYk) and (X„) are assumed to be independent 
of each other. 


In this model, the T„ are the times of jump of Np. if an event occurs for the 
system at time Tn, the intensity grows by an amount ^t{t — Tn,Xn), where Xn 
is some mark. This part models the self-exciting property. In parallel, external 
events occur at times Sk and excites the system of interest with some amount 
— Sk,Yk): this is the externally excited component. The standard Hawkes 
process that have been studied in the previous Sections can be recovered by setting 
<ht(a, x) = 0(a) and \hf(a,x) = 0. The Hawkes process with general immigrants 


has been introduced and studied under stationary conditions by Bremaud and Mas- 


soulie (2002). Due to their flexibility and natural interpretation, such models have 


gained recent attention for financial applications e.g. by Dassios and Zhao (2011), 


Wheatley et ah (2014) and Rambaldi et ah (2014). In particular, distribution prop¬ 


erties of such process have been investigated by Dassios and Zhao (2011) in the case 
<I>t(a,x) = Tf (a, x) = xe in which framework the intensity process is Markovian. 
The aim of this Section is to study the dynamics and characterize the distribution 
of the non-st at ionary Hawkes process with general immigrants for a larger class 
of fertility functions, possibly time-dependent, which extends the previous work of 


Dassios and Zhao (2011) in this direction. To do this, we represent a two-population 


immigration-birth dynamics with ages and characteristics. 


5.1 Description of the two-population dynamics 

The aim is to construct populations of several individuals (or particles), each one 
having an age a evolving over time, and a characteristic x G M+. We construct 
two populations; the first one represents external shocks, whereas the second one 
represents events for the Hawkes process. 

Each population (z), z = 1 or 2, is represented at time t as a measure which puts a 
weight on the age and characteristic of each individual, denoted Zj*^(da, dx). The 
two populations are introduced based on Definition as 

Zf^(da,dx) = ^ 5(i_sj^,y^)(da,dx) and zf^(da,dx) = hp_r„,x„)(da, dx). 

Sk<t Tn<t 

(27) 

Since ages but also characteristics of individuals are involved, we prefer to call 
population structure rather than age pyramid^ which is more specific. As for the 
standard Hawkes population representation, one can compute functions of the whole 
population structure, which can even depend on time. Consider a function ft{a,x) 
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5.1 Description of the two-population dynamics 


depending on time, and also on age and characteristics of individuals. This can be 
computed on the overall population by 

{Zi'\ft)=[ ft{a,x)zl'\da,dx), (28) 

t/ M_|_ X IR._p 


I 9 I I 9 I 

for i = 1 or i = 2. For example, the Hawkes process is Nj: = (Z) ^ 1). Also, the 

/o\ 

intensity Xt of the Hawkes process given in Equation (26) can be rewritten as 


This shows that the underlying population dynamics works as follows. 

(i) Let us first describe the population (1) of external shocks. It is made with 
immigrants that arrive in population (1) with rate p{t)] at arrival, they have age 0 
and some characteristic x drawn with distribution H. Any individual (a, x) at time 
t that belongs to population (1) gives birth with rate d/t(a, x). The newborn belongs 
to population (2); it has age 0, and some characteristic drawn with distribution G. 

(ii) Let us now complete the description of population (2). In addition to births 
from population (1), the population (2) evolves according to two other kind of events: 
immigration and internal birth. Immigrants arrive in population (2) with rate ju(t) 
with age 0 and a characteristic drawn with distribution G. Any individual (a, x) at 
time t that belongs to population (2) gives birth with rate <hi(a,x). The newborn 
also belongs to population (2); it has age 0, and some characteristic drawn with 
distribution G. This dynamics is illustrated in Figure 1. 

As for our analysis of the standard Hawkes process, a crucial step is to study the 
dynamics of the population structure over time. That is, what is the dynamics of 
the process {z‘f\ ft) for z = 1 or 2 ? This is stated in the following lemma. 

Lemma 6. For each function f : {t,x,a) h-)■ ft{a,x) differentiable in t and a, the 
dynamics of the process {z[^\ ft) for i = 1 or 2 is given by 

d{Z,®./.>= f /i(0,i)Af<''(d(,di) + (Z®,(a„+S.)/.)d(, 

where the point measures A^9) j\r(2) ly 

N^^\dt,dx) = ^(5(5j^,y^)(dt,da;) and N^‘^\dt,dx) = ^ 5(r„,x„)(df, dx). (29) 

k>l n>l 


Proof of Lemma The proof is a straightforward adaptation of that of Lemma 
using (27) and (28) together with the fact that 


ft{t - s) = fs{0) + / (9a + dt)fu{,u - s)du.o 


Analogously to Lemma [T| this result exhibits the pure jump part in the left-hand 
side, whereas the drift part illustrates the aging term and the time-dependency. The 
fact that the drift depends on both daft) and {zlf \ dtft) is the starting point 

of our results derived in what follows. 


17 







5.2 Main result 


t 


y 




Figure 1; Dynamics of the immigration-birth process: immigrants arrive in pop¬ 
ulation (1) (external shocks). Then each individual 1 gives birth to individuals 
2 (events due to external shocks). In parallel, immigrants arrive in population 2 
(events due to the baseline intensity). Finally, each individual 2 reproduce (self¬ 
excitation). The Hawkes process with general immigrants can be recovered as the 
number of individuals 2. 

5.2 Main result 

In the following, we introduce the assumptions allowing to recover a hnite dimen¬ 
sional Markovian dynamics. 

Assumption 2. (i) The birth rates <I> and T are non-negative and satisfy <I>i(a, x) = 
v{t)(j){a,x) and \ki(a,x) = w{t)'il!{a,x), where 

n—l p—1 

x) = c_i -|- Ck4>^'^\a, x) and v^\t) = d-i{f) di{t)v^^\t), 

k=0 1=0 

with n,p>l and initial conditions (j)^^\0,x) = 0q^^(x), and 

m—l q—1 

x) = r_i ^ rkip^’^\a, x) and w^‘^\t) = k_i{t) ^ ki{t)w^''\t), 

k=0 1=0 

with m,q > 1 and initial conditions 'if^’^\0,x) = 'ijjQ^\x). Note that we used the 
notation x) = da’^f{a,x). 

(ii) The maps ((i/)_i<Kp_i and {ki)_i<i<q_i are continuous. 

Remark 2. A ssumption\^ defines a wide class of self and externally exciting fertil¬ 
ity functions of the form <Fi(a,x) = u(f)0(a, x). Let us first focus on the time- 
independent part and introduce F{a,x) such that F = (1, 0,..., 0*^"'“^^)^. Then 
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5.2 Main result 


F' = CF, where C is defined in 0. In particular, if the polynomial P{y) = 
y"^ — with distinct roots yi,...,yp and corresponding multiplicity 

ni,...,np, then can he written up to some constant as where Pi 

is a polynomial in a with degree at most n* — 1 whose coefficients may depend on x. 


This includes the framework of Dassios and Zhao (2011) where $t(a, x) = x) = 
xe~^°‘. As we also allow for time-dependency, such birth rates $ and that satisfy 
seem also useful to define non-stationary Hawkes processes, and in particular to 
include seasonality. As an example, one can simply think of a kernel of the form 
cos^{at) f){a, x) where v{f) = cos^(at) satisfies v" = 4q;^(1 — v). 


The aim of this part is to exhibit some exponential martingale which leads us to 
compute the Laplace transform of the whole dynamics. This provides in particular 
the joint Laplace transform of the Hawkes process with general immigrants and its 
intensity. This is the main result of our paper. We hrst state the following Lemma. 

Lemma 7. Let us define for —l<k<n — l and —1 < / < p — 1, : = 

d(^t) and for—1 <k< m—1 and—1 < I < q—1, := {zj:^\ 

Let us also define the two matrices 

andM^^^ = 

(i) Let US recall that D denotes the transpose of a given matrix D. The processes 
MA) and follow the dynamics 


dMf^ 

where 


WA{t,x)NA{dt,dx) + , 


(30) 


• Adl^i{t,x) = wA{t)ijjA\x) for —l<k<m — l and —l<l<q — l, 

• Wl^^i{t,x) = yA{t)(l)A\x) for —l<k<n — l and —l<l<p — l. 


• CA = C{r), CA = C{c), = C{k{t)) and Dfi’ = C{d{t)) where C{.) is 

defined by Equation 0. 

(ii) As a consequence of the dynamics (30), is a Markov process. 


t>o 


Proof of Lemma We focus on the dynamics of the the problem being 
the same for the From Lemma for 0 < k < n — 2 and 0 < / < p — 2, 

dX’(’^ = vA{t) j 0[,^)(a:)X(2)(dt,da;) + (Xf+^’'+ Xf’'+^)dL (31) 

J M_|_ 


19 
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From Assumption 1^ X”’* = and Xf’"" = ■ This 

shows that for 0 < / < p — 2, 


n—1 


dX”"^’' = v^\t) / dx) + ^ CfcXf’' + X”"^’'+^ dt, (32) 

^IR+ \A:=-1 

and for 0 < A; < n — 2, 

dx^-p-i = f (^l^^\x)N^^\dt,dx) + [xf+^’^"^ + ) dt. (33) 


i=-i 


and also that 


n—1 


P-1 


dxn-i,p-i ^ / 0(-i)(2.)jv(2)(dt, da;)+ + 5^ di{t)X^-^’^ df, 

'^®+ Va:=-1 l=-l J 

(34) 

In addition, Lemma gives for 0 < / < p — 1, 

dX“^’' = r;d)(t)dxf ^ + X“^’'+Mf. 

and for 0 < k < n — 2, 

dXf’“^= [ 0i^^(a:)X(2)(df,da;)+Xi^+^’“Mf. 

t/ M_|_ 

Finally, by Assumption again, we get the following two equations: 


(35) 


(36) 


'p-i \ 

5^ d,(t)xr'-‘ d«, 

\i=-i J 


and 


n—1 


dX,"“‘'"' 


d'o" ^^(x)Af*^*(dt,dx) + j ^2 ^ ) di. 


(37) 


(38) 


^fc=-l 


From Equations (31) to (38), one then deduces the dynamics (30). o 

To ensure tractability of the Laplace transform derived in the following Theorem, 
we also state the following assumptions. 


Assumption 3. For each A > 0, 

exp I A max (t>^\x) ) G(x)dx < +cxd. 

V 0<A;<n-l J 

Our main result is stated below. Note that the trace of the matrix uM given 
by Tr('uM) = Y2k i computes a linear combination of the components of a 

given matrix M, and recall that u denotes the transposition of the matrix u. 
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Theorem 8. Let us denote the canonical filtration generated by 
Under Assumption\^ 

(i) For any deterministic and differentiable matrix-valued and with 

derivatives and [A^^'^), the following process is an -martingale: 


2 = 1 


exp <1 Tr ~ j + AfM'fi^Df + AfM^'^J ds 

Fr^As w( Hs,x)) _ 1 ^ p{s)H{x)dxds (39) 




Tr A 


,( 2 ) 


{p{s) + mW[ 0,0] + Mf)[0,0]) G(a;)dxds|. 


(ii) For each matrices u and v with dimensions (n + 1) (p +1) and (m + 1) (g +1) 
respectively, the joint Laplace transform can be expressed as 

= exp 


E 


exp i^riuM^^'^ + 




TrU: 


e 

( 2 ) 


Tr(^A„ ds,x)^ — 1 ^ p(s)if(a;)da;ds 


w( ^(s)G(x)dxds|, 


(40) 


where 

for t G {1, 2}, = 


1 — e 


Tri A 


i(2) 




with terminal conditions 


= u and Alf’ = v, 


( 2 ) 


G(a;)da; j K, 

(41) 

(42) 


where the matrix K is given by K = JJ and J is given in (13). Moreover, solutions 
to ( 41 )-( 42 ) exist provided that Assumption^is satisfied. 


Proof of Theorem We begin by exhibiting the exponential martingale 
Let us denote H(s,x)N^^'>{ds,dx). For deterministic a{t,x) and 

f3{t, x), then by the classical exponential formula the following process is a martingale 


exp 





0 JR+ 


— 1 ) p(s)H{x)dxds 



- 1) (^(s) + (ZP.H.) + {Zf>, <!>.)) G(r)drdsj. 


(43) 


The aim now is to compute the joint Laplace transform of the processes and 

, since Tr(h.M) = Y.k,Fk,iMk,i. 


(‘ 2 '] 

Mf . This remains to compute E 


F'r(u.Mfi'’+v.Mfi( 
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Let us consider the two (deterministic) processes and with sizes (m + 
l)(g + 1) and (n + l)(p + 1) respectively. By integration by parts, d (= 


From (30), we get the dynamics 


dTr ( AX’Mr ) = 


/ Tr iV«(dt,dx) 

+ Tr (+ A^MfA dt 


Let us now use Equation (43) with a(t,x) = Tr and (5{t.,x) = 

Tr (^ArW^‘^\t,x)^ to get the martingale (39). 

To get the Laplace transform, it remains to make the random part of the integrant 


m 


(39) vanish. To do this, let us hrst identify the term in M^) to get the linear 


equation (41) for i = 1. In addition, the term in leads to (41) for i = 2. If we 


set terminal conditions (42), we get the Laplace transform (40) by the martingale 


property of (39) 


To conclude on the existence and uniqueness, we use Cauchy-Lipschitz theorem. 
To show that solution of class to (41) exist and is unique, it is sufficient to prove 
that the map (Y, t) h-)■ gTr(yw( is of class C^. Since the integrant is 

by Assumption (i) and (ii) , it is sufficient to prove that its gradient given by 


p(y w(2) {t,x))Y^ gTv(y w(2) O.^)) 


X 


(44) 


is locally bounded by some quantity that is independent of Y and f, and is inte- 
grable with respect to G. Let us use some localization argument, and define the set 
5(0, r) = {A real (u + 1) x (p + 1) matrix such that ||A||^ < r}, where r > 0 and 
, = max_i<i<„_i Y7iZ-i Now, for {Y,t) e 5(0, r) x [0,T] we get 


exp(Tr(FIT(2)(t^a;))) 

( n—l p—1 \ 

/ p-i 

< exp (n + 1) 


k=-l 


< exp I r(?7, + 1) max sup max Srix) 

fg[0,T] -l<fc<n-l 


where the last inequality uses that Y G 5(0, r). As for the first component of (44), 
Yi,k I < 'f' and for the second component we have 

dtW'Tjit^x) < (t)o\x) supigfo^ rp] |i;0+i)(t)|, this concludes the proof by the use of 
Assumptions 1^ and o 
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5.3 On the pathwise representation 


5.3 On the pathwise representation 


As for the standard Hawkes process, it is possible to give a pathwise representation 
of the Hawkes process with general immigrants and its nnderlying popnlation. To 
do this, let ns hrst extend the thinning constrnction in Section to point processes 
with marks. Poisson point measnres can be used, not only to represent counting 
processes, but also general random point measures on M_|_ x E, say r(ds,dj/) = 
X]n>i ^Tn,v„)(ds, d?/), where {E,£) is some measurable space. As for the Hawkes 
process, Tn is seen as the time at which an individual arrives in the population 
(immigration or birth). In addition, E represents the space of characteristics and 
the mark Yn refers to the characteristic inherited by the individual that arrived at 
time Tn- Let us construct a random point measure r(ds,d|/) with general intensity 
measure 7 (ds, dy) assuming that it admits a density: 7 (ds, dy) = 7 (s, y) ds y{dy). In 
this model, events occur with intensity s h->• f^^^j(s,x)p.(dx), and if a birth occurs 
at time T„, then the characteristics Vn of the newborn are drawn with distribution 

LeE'y(Tn,x)tj.{dx)- 

Let (5(ds, dy, d 6 *) be a Poisson point measure on IR+ xEx IR+ with intensity measure 
ds/i(d|/)d 6 *. Let us still denote the canonical hltration generated by Q, and 

introduce the predictable cr-held associated with We further assume 

that 'y{t,y) is P{E^) x T-measurable and also that J* J^j(s,y)dsp.(dy) < +oo a.s.. 
Now, dehne 

T{ds,dy) = f l[o,..y(^,y)]( 6 ')Q(ds,d 2 /,d 6 '). (45) 

M_|_ 


This clearly dehnes a point measure and the martingale property for Q ensures that 
the random point measure r(ds,dj/) has intensity measure 'y{s,y)dsy{dy). Such 
construction can be found in Massoulie (1998); we refer to this paper for more 
details. 


We are now ready to construct the two point measures and given 


m 


Equation (29). Let us introduce two independent Poisson point measures Q’^^^(dt, dx, d^) 
and ( 5 ^^^(dt, dx, d^) on the probability space (r 2 ,J-', P) (enlarged if necessary) with 
same intensity measure dsdxd 6 * on M_|_ x ]R_|_ x IR+. The hrst point measure is im¬ 
mediate to construct since its intensity does not depend on it. Indeed, one can 
dehne 

N^^\dt,dx)= [ l[o,p(p/f(„)]( 6 ')Q(^)(dt,dx,d 6 '). 


We emphasize that this is not an equation on since its intensity does not depend 
on itself. As for the second point process related to the Hawkes process, the 
intensity is given as a particular for of the process itself. Indeed, the intensity 
measure of the point measure N^'^\dt, dx) is given by XtG{x) where Xt can be written 
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using (26) as 


At =/i(t) + / — s, dx) + / d't(t — s, x)iV^^^(ds, dx). 

J(0,t) J(0,t) 

Then the point measure can be defined as the solution to the following equation: 

\dt,dx) = / 1 [o,(/,(*)+/ $t(i-s,a:)Ar(2)(ds,dx)+/(Q .^t(i-s,x)iV(i)(ds,dx))G(x)](^)^^ ^ (dt, dx, d^). 

M-|- ’ ’ 

Let us now give the pathwise representation of the corresponding populations. 


From Equation (27), it follows that 
Zf^^(da,dx)= [ [ 


(0,t] ./M+ xl 


l[o,p(s)ri-(x)](6')%-^,x)(da, dx)Q^^^(ds, dx, d^). (46) 


and 

Zt^^(da,dx) = 


’ (o,t] xi 


(47) 

Such representations are used in the field of stochastic population dynamics for 


populations with ages and/or characteristics (see in particular Fournier and Meleard 


(2004), Tran (2008) and Bensusan et ah (2010-2015)). As for the standard Hawkes 


process, the pathwise representation has many advantages. In particular, it allows 
to derive the full system of equations and to identify each generation (see Section 
1^. More importantly, this formulation makes the link between the Hawkes process 
literature and the field of stochastic population dynamics. To further investigate 
this link seems to be a promising direction for future research. 


Conclusion 

We introduced the concept of age pyramid for a class of Hawkes processes with 
general immigrants. The virtue of this approach is to keep track of all past events. 
This allows tractable computations for the Hawkes process with general immigrants 
whose fertility functions are time dependent generalizations of the popular expo¬ 
nential case, providing natural extensions of the existing results in this direction. 
In addition, we illustrated the pathwise construction of the Hawkes dynamics and 
its underlying population process. On the whole, our approach seems to reconcile 
two definitions of Hawkes processes, through an intensity process or a branching 
dynamics. This framework appears to be a promising direction for further research. 
As an example, the large population asymptotics in the field of measure-valued pop¬ 
ulation dynamics could give further insights on the macroscopic behavior of Hawkes 
processes. 
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Appendix 


Proof of Corollary Let us identify the terms in Equation ( [ItI ). Let us denote 
At = An-i{t)). The identification of the first component gives 


C-lAn-l{t) + — 0 . 

The second component leads to 

A(^) + c-oAn-iit) + — 1 = 0. 

As for 1 < fc < n — 1, we get 

Ak-i(t) + CkAn-i(t) + Aj^it) = 0. 


Recursive computation of (50) provides for 0 < fc < n — 1, 

n—l—k 

i-l-fc A (n-l-fc) 




i=i 


One deduces from (51) that 


n—1 




k=l 


(48) 


(49) 


(50) 


(51) 


(52) 


Let us intoduce the function G{t) = A„_i(s)ds and choose A_i{t) = 61 — C-iG{t) 
that satisifes (48). Now, put ( [5l| ) and ( [5^ into ( [4^ to get the following non-linear 
ordinary differential equation for G, 


n—1 / 

(_l)»^-iGO+i)(i) + + exp ( 01 - c_iG(f) 

k=^ 

n—2 

+ mn-iG'{t) + 

k=0 

(53) 


n—1 —fc 




l=l 


1 = 0 . 


Let us simplify the sum in the exponential. By changing variable k into n — l — k, 
it is equal to Ylk=i mn-i-k{-l)^G^^^^\t) + J 2 'i=ii-'^y^n-i-kCn-i-k+iG'^’'\t). 
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Then exchanging the sums leads to 

Finally, by setting /<(—/ + ! and exchanging notations k and I, (53) becomes 

n—1 


n—1 


(_l)-iG(-+b(t) + +exp 9, - c_iG'(t) + -1 = 0, 

k =0 V k =0 / 

( 54 ) 

where for 0 < < n - 1, = (-1)‘ - lOITe+i mn.-i-iCn-i+t)- 

Now, let us use (20) with (49) to get 

E [exp (n.Xr)] = exp |^-/i (v4o(t) + Co/l„_i(t))dt^ , 


= exp —p 


k =0 




= exp 




where the second equality comes from (52). Let us set 2 lo(T) = 62 and for 1 < k < 
n— 1 , Ak{T) = 0. One can show by (51) that the previous conditions are equivalent 
to the terminal values G^^\T) = (— 1 )"“^ 6*2 and for 1 < A; < n — 1 , = 0 . 

Note that by dehnition of G we also get G{T) = 0. We thus get 


E [exp (0iiVT + 92 .{Zt, m = exp |-p + (-1)"G(")(0) + ^(-l)"+'cfcG('^n0) ) 

This concludes the proof, o 


k =0 
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